NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Low-rank finetuning for LLMs: A fairness perspective

Das, Saswat; Romanelli, Marco; Tran, Cuong; Kailkhura, Bhavya; Fioretto, Ferdinando (February 2025, AAAI CoLoRA Workshop, 2025)

Free, publicly-accessible full text available February 27, 2026
Speculative Diffusion Decoding: Accelerating Language Generation through Diffusion

Christopher, Jacob K; Cardei, Michael; Bartoldson, Brian R; Kailkhura, Bhavya; Fioretto, Ferdinando (February 2025, Annual Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL))

Free, publicly-accessible full text available February 1, 2026
Training Dynamics of Transformers to Recognize Word Co-occurrence via Gradient Flow Analysis

Yang, Hongru; Kailkhura, Bhavya; Wang, Zhangyang; Liang, Yingbin (December 2024, Conference on Neural Information Processing Systems (NeurIPS 2024))

Understanding the training dynamics of transformers is important to explain the impressive capabilities behind large language models. In this work, we study the dynamics of training a shallow transformer on a task of recognizing co-occurrence of two designated words. In the literature of studying training dynamics of transformers, several simplifications are commonly adopted such as weight reparameterization, attention linearization, special initialization, and lazy regime. In contrast, we analyze the gradient flow dynamics of simultaneously training three attention matrices and a linear MLP layer from random initialization, and provide a framework of analyzing such dynamics via a coupled dynamical system. We establish near minimum loss and characterize the attention model after training. We discover that gradient flow serves as an inherent mechanism that naturally divide the training process into two phases. In Phase 1, the linear MLP quickly aligns with the two target signals for correct classification, whereas the softmax attention remains almost unchanged. In Phase 2, the attention matrices and the MLP evolve jointly to enlarge the classification margin and reduce the loss to a near minimum value. Technically, we prove a novel property of the gradient flow, termed \textit{automatic balancing of gradients}, which enables the loss values of different samples to decrease almost at the same rate and further facilitates the proof of near minimum training loss. We also conduct experiments to verify our theoretical results.
more » « less
Full Text Available
Training dynamics of transformers to recognize word co-occurrence via gradient flow analysis

Yang, Hongru; Kailkhura, Bhavya; Wang, Zhangyang; Liang, Yingbin (December 2024, Advances in Neural Information Processing Systems (NeurIPS))

Full Text Available
Training Dynamics of Transformers to Recognize Word Co-occurrence via Gradient Flow Analysis

Yang, Hongru; Kailkhura, Bhavya; Wang, Zhangyang; Liang, Yingbin (November 2024, Neural Information Processing Systems (NeurIPS))

Full Text Available
Speculative Diffusion Decoding: Accelerating Language Generation through Diffusion

https://doi.org/10.18653/v1/2025.naacl-long.601

Christopher, Jacob K; Bartoldson, Brian R; Ben-Nun, Tal; Cardei, Michael; Kailkhura, Bhavya; Fioretto, Ferdinando (January 2025, Association for Computational Linguistics (NAACL))

Full Text Available
Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense

https://doi.org/10.18653/v1/2025.naacl-long.623

Ouyang, Yang; Gu, Hengrui; Lin, Shuhang; Hua, Wenyue; Peng, Jie; Kailkhura, Bhavya; Gao, Meijun; Chen, Tianlong; Zhou, Kaixiong (January 2025, Association for Computational Linguistics)

Full Text Available
Leveraging Hierarchical Feature Sharing for Efficient Dataset Condensation

Zheng, Haizhong; Sun, Jiachen; Wu, Shutong; Kailkhura, Bhavya; Mao, Zhuoqing; Xiao, Chaowei; Prakash, Atul (November 2024, Lecture Notes in Computer Science, Springer)

Full Text Available
GTBench: Uncovering the Strategic Reasoning Capabilities of LLMs via Game-Theoretic Evaluations

Duan, Jinhao; Zhang, Renming; Diffenderfer, James; Kailkhura, Bhavya; Sun, Lichao; Stengel-Eskin, Elias; Bansal, Mohit; Chen, Tianlong; Xu, Kaidi (December 2024, Neural Information Processing Systems Foundation, Inc. (NeurIPS))

As Large Language Models (LLMs) are integrated into critical real-world applications, their strategic and logical reasoning abilities are increasingly crucial. This paper evaluates LLMs' reasoning abilities in competitive environments through game-theoretic tasks, e.g., board and card games that require pure logic and strategic reasoning to compete with opponents. We first propose GTBench, a language-driven environment composing 10 widely-recognized tasks, across a comprehensive game taxonomy: complete versus incomplete information, dynamic versus static, and probabilistic versus deterministic scenarios. Then, we (1) Characterize the game-theoretic reasoning of LLMs; and (2) Perform LLM-vs.-LLM competitions as reasoning evaluation. We observe that (1) LLMs have distinct behaviors regarding various gaming scenarios; for example, LLMs fail in complete and deterministic games yet they are competitive in probabilistic gaming scenarios; (2) Most open-source LLMs, e.g., CodeLlama-34b-Instruct and Llama-2-70b-chat, are less competitive than commercial LLMs, e.g., GPT-4, in complex games, yet the recently released Llama-3-70b-Instruct makes up for this shortcoming. In addition, code-pretraining greatly benefits strategic reasoning, while advanced reasoning methods such as Chain-of-Thought (CoT) and Tree-of-Thought (ToT) do not always help. We further characterize the game-theoretic properties of LLMs, such as equilibrium and Pareto Efficiency in repeated games. Detailed error profiles are provided for a better understanding of LLMs' behavior. We hope our research provides standardized protocols and serves as a foundation to spur further explorations in the strategic reasoning of LLMs.
more » « less
Full Text Available
Transformers Can Do Arithmetic with the Right Embeddings

McLeish, Sean; Bansal, Arpit; Stein, Alex; Jain, Neel; Kirchenbauer, John; Bartoldson, Brian R; Kailkhura, Bhavya; Bhatele, Abhinav; Geiping, Jonas; Schwarzschild, Avi; et al (December 2024, ArXiv)

Full Text Available

« Prev Next »

Search for: All records